PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG008214t3
Common NameTCM_008214
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HB-other
Protein Properties Length: 1782aa    MW: 199940 Da    PI: 5.1436
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG008214t3genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox597.8e-192882357
                      --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
          Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57
                      kR   t++qle+Le+ +++++yps+++r+eL+ +lgL++rq ++WF+ rR k++k
  Thecc1EG008214t3 28 KRKMKTASQLEILEKTYAMEMYPSEATRAELSVQLGLSDRQLQMWFCHRRLKDRK 82
                      89999************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.603.7E-18982IPR009057Homeodomain-like
SuperFamilySSF466893.51E-161582IPR009057Homeodomain-like
PROSITE profilePS5007116.5032383IPR001356Homeobox domain
SMARTSM003897.3E-172587IPR001356Homeobox domain
CDDcd000863.04E-142682No hitNo description
PfamPF000462.1E-162882IPR001356Homeobox domain
PROSITE profilePS5082716.82551610IPR018501DDT domain
SMARTSM005714.6E-22551610IPR018501DDT domain
PfamPF027912.4E-16552607IPR018501DDT domain
PfamPF050661.5E-15733801IPR007759HB1/Asxl, restriction endonuclease HTH domain
PfamPF156121.1E-7952993IPR028942WHIM1 domain
PfamPF156131.4E-1311341206IPR028941WHIM2 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010228Biological Processvegetative to reproductive phase transition of meristem
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1782 aa     Download sequence    Send to blast
MDSGGVSGGG GSSEGEKKKP PEGETKVKRK MKTASQLEIL EKTYAMEMYP SEATRAELSV  60
QLGLSDRQLQ MWFCHRRLKD RKAPPVKRRR KDSSLPAQVV GVAGEEMGGG EAENEHGSDV  120
SSLFGPGLHL RRAVPIPGMA VPRYYEMTHS MAELELRAIT FVELQLGEPI RDDGPMLGME  180
FDPLPPGAFG APIVGASTAV QQKQPGQPFE TKIYERLDTK AVKGSVRAVH EYQFLPEQPS  240
VRTETYERVA LSYHYGSPTD DPHARASSLS TGCSFVHGNE KVPSGYGFSG QMPNLNLLPQ  300
QSRQGHLLPT ASGEYDNCSR KNSLTNTTVD AIIGAHPISA LESPFVSSDR RVNLDEDALR  360
MERKRKSEEA RIAREVEAHE KRIRKELEKQ DILRRKREEQ IRKEMERHDR ERRKEEERLL  420
REKQREEERY QREQRRELER REKFLMKESI RAERMRQKEE LRKEKEAARL KAANERAIAR  480
KLAKESMELI EDERLELMEL AASSKGLSST LSLDFEILQN LDIFRDKLCV FPPKGVQLKR  540
SFSIEPWNSS EESIGNLLMV WRFLITFADV VGLWPFTLDE LVQAFHDYDP RLLGEIHVAL  600
LRSIIKDIED VARTPSTGLG ASQNNAANPG GGHLQIVEGA YAWGFDIRSW QGHLNMLTWP  660
EILRQFALSA GFGPQLKKRN IEQAYLRDEN EGNDGEDIIT NLRNGAAAEN AVAIMQERGF  720
SNPRRSRHRL TPGTVKFAAF HVLSLEDSDG LTILEVAEKI QKSGLRDLTT SKTPEASIAA  780
ALSRDTKLFE RTAPSTYCVR SPYRKDPADA EAILSAARER IRVLKSGFVG EDAEGAERDE  840
DSESDIAEDL EVDDLGAEIN PKKEMLNSEG SSSCDAKTIL GNEKEICEIL ETPQGEVRNV  900
CKALSSPTAG GLDEVKYIDA PVEQSMDAAG ICNGAANAGL EDTEIDESKL GEPWVQGLME  960
GDYSDLSVEE RLNALIALIS IAIEGNSIRV VLEERLEAAN ALKKQMWAEA QLDKRRMKEE  1020
FVLRTNFSSH MGNKMEPSLM MSSAECRQSP QIISDRKNNE SSVDLVVQQE CLNNPQNDQN  1080
YLNNVPSEGN MPIQDFSIGP DNLQYPQPGC AAERSRSQLK SYIGHKAEEM YVYRSLPLGQ  1140
DRRHNRYWRF ITSASWNDPG CGRIFVELLD GRWRLIDTEE GFDTLLSSLD VRGVRESHLH  1200
AMLQKIEMSF KEAVRRNKLH VNMERQNGDT IKKEANEMAS GPDWNVSFES PSSTVSGSDS  1260
DMSETSTSFS IELCRNEIEK NDALKRYRDF EKWMWKECFS LSSFCATKYG RRRCKQLLGV  1320
CDSCFNIYFF EDNHCPSCHR TDIASRSMLN FSEHVAQCAK KLQLGPGFAL DGLVISPLRI  1380
RLTKLQLALV EVSIPFEALQ SAWTEGYRNF WGMKLYSSTT AEELLQVLTL LESSITRDYL  1440
SSNFETTREL LSPSILSGGV GDDSTNLETV PVLPWIPKTT AAVALRLIEF DAAISYTLKQ  1500
RAETHKGAGE CMFPSKDAVV KNNQDHERMQ TTNRVEYLQE ASWVDVGIGF SGSGRGRGRG  1560
RGRGVTRGGR SQRRPTGSRS EFGKRITTTD NEGLVPVLGW KSRSRGRGGR KRGRRSARSR  1620
PKPAKRMVEI AGERENPKEI MEKSSRNLAT NTWNGDEVTR LKVRTADNAS SSERSEYNDE  1680
NGQATGDEYD YLAGEDYAGG FNGKADDVME GSEYNIDGDE DDDGEERDDI AEGEQGNFIV  1740
GGYINENSDE EEIRNGDDPE DSDPYVKQYG YSTEASSDFS E*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
115541563RGRGRGRGRG
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007043693.10.0Homeodomain-like transcriptional regulator, putative isoform 3
SwissprotQ9FFH10.0RLT2_ARATH; Homeobox-DDT domain protein RLT2
TrEMBLA0A061E3A90.0A0A061E3A9_THECC; Homeodomain-like transcriptional regulator, putative isoform 3
STRINGPOPTR_0017s04760.10.0(Populus trichocarpa)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G44180.10.0Homeodomain-like transcriptional regulator
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]